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ABSTRACT 


Suppose  samples  of  size  n  are  chosen  at  random  from 
lots  of  size  N  and  examined  for  the  presence  (or  absence) 
of  a  certain  type  (or  types)  of  defect.  If  inspection  is 
not  perfect,  there  may  be  a  probability,  p,  of  failing  to 
detect  a  defect  when  it  is,  in  fact,  present.  It  is  de¬ 
sired  to  estimate  p  from  records  of  the  number  of  items 
found  to  have  the  defect (s)  on  inspection. 

In  the  absence  of  further  information,  the  problem 
cannot  be  answered  on  the  basis  of  single  samples,  of  size 

n,  from  each  of  a  number  of  lots.  Borne  further  assump¬ 
tions  are  needed  -  such  as  supposing  the  number  of  de¬ 
fective  items  per  lot  is  (nearly)  constant.  Alternatively 
'  some  more  elaborate  sampling  -  for  example,  repeated  sam¬ 
pling  from  the  same  lot  -  may  provide  the  necessary  in¬ 
formation. 


The  paper  describes  several  methods  of  estimation 
utilizing  various  forms  of  additional  data  and  informa¬ 


tion. 

cussed 


Some  as  yet  unsolved  problems  are  briefly  dis- 
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I.  Introduction 


Suppose  samples  of  size  n  are  chosen  at  random  from  lots  of  size  N  and 
examined  for  the  presence  (or  absence)  of  a  certain  type  (or  types)  of  defect. 

If  Inspection  Is  not  perfect,  there  may  be  a  probability  (1-p)  of  falling  to  de¬ 
tect  a  defect  when  It  Is,  In  fact,  present.  It  Is  desired  to  estimate  p  from  , 
records  of  the  number  of  Items  found  to  have  the  defect(s)  on  Inspection.  (We 
assume  that  the  probability  of  'false  positive'  -  declaring  a  good  item  to  be 
defective  -  Is  negligible.) 

If  sampling  Is  with  replacement.  It  Is  not  possible  to  estimate  p  on  the 
basis  of  single  samples,  of  size  n,  from  each  of  a  number  of  lots,  even  supposing 
the  actual  proportion  of  defective  Items  (6)  to  be  constant  from  lot  to  lot. 

This  Is  because  the  distribution  of  the  observed  number  of  defectives  (Z)  In  such 
a  sample  Is  binomial  with  parameters  (n,  ep)  and  it  Is  not  possible  to  disen¬ 
tangle  p  from  e. 

If  sampling  Is  without  replacement,  again  supposing  the  proportion  of  de¬ 
fectives^  (6  *  D/N  when  D  Is  the  number  of  defectives  In  the  lot)  the  distribution 
of  Z  Is  a  mixture  of  binomial  distributions  with  parameters  (Y,p)  -where  Y  -  the 

actual  number  of  defective  items  In  the  sample  has  a  hyprgeometrlc  distribution 

n 

with  parameters  (n,D,N).  Conventionally  we  write  (as  In  [1]). 

Z-~  Bln(Y.p)  a  Hypg  (n,D,N) 

Y 


From  a  set  of  values  of  m  Independent  random  variables  (Zj,  Z2,...Zm).  knowing 

n  and  N,  It  Is  possible  to  estimate  p  by  maximum  likelihood  from  the  likelihood 
function 


The  limits  of  summation  for  y  are  max(Zj,n-N+D)  <  y  <  mln(n.D).  However,  It  Is  to 
be  expected  -  In  view  of  the  fact  that  as  N  Increases  we  approved  the  situation  of 
sampling  without  replacement  -  that  no  great  accuracy  1«  estimating  p  will  be 
attainable.  A  major  part  of  difficulty  In  the  nulsafe  parameters  D(or  0»D/N).  For 
reasonable  accuracy  some  form  of  repeated  sampling  will  be  needed. 

2.  Estimates  of  p 

If  we  are  In  the  fortunate  position  of  having  a  number  of  Items  known  to  be 
defective  we  can  obtain  a  simple  estimator  (of  p )  by  testing  them  repeatedly  and 
estimating  p  by  the  proportion  of  times  a  'defective*  decision  Is  obtained.  If 
this  Is  not  the  case  at  the  beginning  of  the  Investigation  we  might,  however, 
be  In  a  position  to  exploit  the  fact  that  an  Item  declared  'defective'  at  any 
time  must  (according  to  our  assumptions)  be  defective.  Denoting  by  Nj  the  number 
of  Items  declared  defective  just  j  times  In  m  trials  (so  that  NQ  ♦  N1  +...+  Nm  «  n, 

and  If  p  ■  1,  Nq  ■  n-y,  ■  y)  a  plausible  but  specious  argument  might  run  as 
follows: 


"For  each  Item,  we  discard  the  first  'defective'  decision  and  observe 

m 

the  proportion  of  defective  decision  In  the  remaining  J  N.  sets  of 

j-1  3 

(m-1)  trials.  Since  these  are  independent,  the  total  number  of 

m 

defectives  in  the  trials  has  a  binomial  ((m-1)  T  N.,p)  distribution, 

j-1  3 

and  our  estimate  of  p  is  unbiased,  with  an  easily  compute^  standard 
deviation.  (If  Nq  ■  m,  then  no  estimation  Is  possible).* 


-  3  - 


(It  Is  not  difficult  to  see  that  this  will  produce  a  negatively  biased  estimator 
of  p,  because  In  all  the  trials  which  are  thrown  away  a  decision  of  'defective* 
Is  reached.  In  fact  the  estimator  Is 


1 

(m— 1 )  J  N. 
j-1  J 


l  (J-l)N. 
>1  i 


J 


m 


m 


1 


and  Its  expected  value  Is 


mp 

l-(l-p)m 


(10) 


He  could  take  notice  of  only  those  trials  following  the  first  'defective* 
decision;  although  this  will  not  use  all  the  Information  available.  It  does  lead 
to  simple  formulae.  The  observed  proportion  of  'defective'  decisions  Is  noj^an 
unbiased  estimator  of  p;  the  (conditional)  distribution  of  the  number  of  'defective' 
decisions  counted  Is  binomial  with,  parameters  (total  number  of  Inspections  Included 
In  count,  p). 

Suppose,  now,  that  we  have  m  repeated  Inspections  of  a  set  of  n  Items  (among 
which  an  unknown  number  y  are  defective)  and  have  been  able  to  record  the  results 
of  Individual  Inspections  (and  not  just  total  number  of  decisions  of  'defective' 
for  each  Inspection  of  the  set  of  n  Items).  The  likelihood  function  Is 


(jr-n4K0>Nj . nJ  (l-p)'"(y_rvvH0) 


(11) 


(n-NQ  £  y  £  ") 


-  4  - 


where  Nn  ■  n  -  7  N.  Is  the  number  of  Items  which  are  not  declared  defective  in 

0  j»l  J 

m 

any  of  the  m  inspections.  Note  that  (Nn,  l  jNj  is  a  sufficient  statistic  for 

u  j-1  J 

(y,p);  JjNj  is  the  total  number  of  'defective'  decisions.  If  y*"-  were  known,  the 


maximum  likelihood  estimator  of  p  would  be 


p(y)  *  (my)-1  lJNi  «  (my)”1!,  say 
ju  J 


The  corresponding  maximized  log  likelihood  would  be 


n-Vl 


log  L(y)  -  K  +  7  log  (y-1)  ♦  (my-T)  log(my-T)-my  log  my  (13) 

1-0 


where  K  does  not  depend  on  y.  We  then  seek  to  maximize  (13)  with  respect  to  y, 
subject  to  n  >  y  >  n  -  Ng.  Note  that  we  are  not  primarily  Interested  In  the  value 
of  y  Itself,  but  we  need  the  value,  y,  maximizing  (13)  to  calculate  the  maximum 
likelihood  estimator,  p(p),  of  p. 

m 

A  useful  practical  method  Is  obtained  by  noting  that  for  the  £  N.  Items  which 

j-l  J 

we  know  to  be  defective,  the  numbers  of  times  each  item  Is  declared  'defective'  can 
be  regarded  as  observed  values  of  independent  random  variables  each  having  a  binomial 
(m,p)  distribution,  truncated  by  omission  of  zero  values.  Each  such  variable  has 
expected  value  mp{l-(l-p)m}~1.  Equating  sample  means  and  expected  values  gives  the 


equation 


■PU-U-pff1  -  (n-Ng)*1! 


«  Average  number  of  times  an  article  Is  judged  defective, 
given  that  It  is  so  judged  at  least  once. 


This  estimator  Is,  In  fact,  the  conditional  maximum  likelihood  estimator  of  p,  given 


-  5  - 


The  equation  can  be  solved  iteratively,  writing  It  In  the  form 

P  •  m‘1(n-N0)‘hri-(l-P)m}  (14) 

We  note  that 

var(T)  «  mnp(l-p)e  -  m  nfn-lHN-l)'^  e{l-e) 

and 

var(p)  «  m’2  (n-N0)'2{l-(l-P)m>2  var(T) 

If  an  estimator  of  y  Is  needed,  we  note  that,  with 

P  *  P.  E[  £  H,  ■  n  -  N„|y,p]  -  y{l-(l-P)*}. 
j*l  3  0 

Replacing  expected  by  actual  values,  we  get  the  estimator 


J  »  m1n(n,  jjn-N0){Hl-p)m>"1  ])  »  m1n(n,  ^(mp)-1  J  j  )  (15) 

where  [  ]  denotes  'nearest  Integer  to1. 

As  a  numerical  example  suppose  we  test  each  of  50  («n)  Items  three  (*m)  times 
and  obtain 

Nq  «  43;  Nj  -  1;  N2  «  1;  N3  »  5 


so  that 


T  - 


1  +  2  +  15  ■  18.  Equation  (14)  gives 
p{3p  -  3p2  +  p3}*1  ■  (3  x  7)-118  «  6/7 


where 


leading  to  p  -  0.8545 


Fro,  (IS),  j)  ■  nil,  f  50.  f  16 _ 1  ]■  7. 

I  [  3  X  O.S545  ] 
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Note  that  the  same  values  of  p  and  y  would  be  obtained,  whatever  the  value 
of  n  (>  7). 

Maximum  likelihood  estimation  has  been  discussed  by  Lindsey  [2]. 
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